This article describes a modified technique for enhancing noisy speech to improve automatic speech recognition\r\n(ASR) performance. The proposed approach improves the widely used spectral subtraction which inherently suffers\r\nfrom the associated musical noise effects. Through a psychoacoustic masking and critical band variance normalization\r\ntechnique, the artifacts produced by spectral subtraction are minimized for improving the ASR accuracy. The popular\r\nadvanced ETSI-2 front end is tested for comparison purposes. The performed speech recognition evaluations on the\r\nnoisy standard AURORA-2 tasks show enhanced performance for all noise conditions.
Loading....